Search CORE

20 research outputs found

TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers

Author: Fogel Sharon
Litman Ron
Nuriel Oren
Publication venue
Publication date: 24/07/2022
Field of study

Leveraging the characteristics of convolutional layers, neural networks are extremely effective for pattern recognition tasks. However in some cases, their decisions are based on unintended information leading to high performance on standard benchmarks but also to a lack of generalization to challenging testing conditions and unintuitive failures. Recent work has termed this "shortcut learning" and addressed its presence in multiple domains. In text recognition, we reveal another such shortcut, whereby recognizers overly depend on local image statistics. Motivated by this, we suggest an approach to regulate the reliance on local statistics that improves text recognition performance. Our method, termed TextAdaIN, creates local distortions in the feature map which prevent the network from overfitting to local statistics. It does so by viewing each feature map as a sequence of elements and deliberately mismatching fine-grained feature statistics between elements in a mini-batch. Despite TextAdaIN's simplicity, extensive experiments show its effectiveness compared to other, more complicated methods. TextAdaIN achieves state-of-the-art results on standard handwritten text recognition benchmarks. It generalizes to multiple architectures and to the domain of scene text recognition. Furthermore, we demonstrate that integrating TextAdaIN improves robustness towards more challenging testing conditions. The official Pytorch implementation can be found at https://github.com/amazon-research/textadain-robust-recognition.Comment: 12 pages, 8 figures, Accepted to ECCV 202

arXiv.org e-Print Archive

Sequence-to-Sequence Contrastive Learning for Text Recognition

Author: Aberdam Aviad
Anschel Oron
Litman Ron
Manmatha R.
Mazor Shai
Perona Pietro
Slossberg Ron
Tsiper Shahar
Publication venue
Publication date: 20/12/2020
Field of study

We propose a framework for sequence-to-sequence contrastive learning (SeqCLR) of visual representations, which we apply to text recognition. To account for the sequence-to-sequence structure, each feature map is divided into different instances over which the contrastive loss is computed. This operation enables us to contrast in a sub-word level, where from each image we extract several positive pairs and multiple negative examples. To yield effective visual representations for text recognition, we further suggest novel augmentation heuristics, different encoder architectures and custom projection heads. Experiments on handwritten text and on scene text show that when a text decoder is trained on the learned representations, our method outperforms non-sequential contrastive methods. In addition, when the amount of supervision is reduced, SeqCLR significantly improves performance compared with supervised training, and when fine-tuned with 100% of the labels, our method achieves state-of-the-art results on standard handwritten text recognition benchmarks

arXiv.org e-Print Archive

Caltech Authors

CLIPTER: Looking at the Bigger Picture in Scene Text Recognition

Author: Aberdam Aviad
Bensaïd David
Ganz Roy
Golts Alona
Litman Ron
Mazor Shai
Nuriel Oren
Tichauer Royee
Publication venue
Publication date: 23/07/2023
Field of study

Reading text in real-world scenarios often requires understanding the context surrounding it, especially when dealing with poor-quality text. However, current scene text recognizers are unaware of the bigger picture as they operate on cropped text images. In this study, we harness the representative capabilities of modern vision-language models, such as CLIP, to provide scene-level information to the crop-based recognizer. We achieve this by fusing a rich representation of the entire image, obtained from the vision-language model, with the recognizer word-level features via a gated cross-attention mechanism. This component gradually shifts to the context-enhanced representation, allowing for stable fine-tuning of a pretrained recognizer. We demonstrate the effectiveness of our model-agnostic framework, CLIPTER (CLIP TExt Recognition), on leading text recognition architectures and achieve state-of-the-art results across multiple benchmarks. Furthermore, our analysis highlights improved robustness to out-of-vocabulary words and enhanced generalization in low-data regimes.Comment: Accepted for publication by ICCV 202

arXiv.org e-Print Archive

Both Intrinsic and Extrinsic Religious Orientation are Positively Associated with Attitudes Toward Cleanliness: Exploring Multiple Routes from Godliness to Cleanliness

Author: Finkelstein Ron
Litman Leib
Robinson Jonathan
Weinberger-Litman Sarah L.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2019
Field of study

The Touro College and University System

European Neuromuscular Centre consensus statement on anaesthesia in patients with neuromuscular disorders

Author: Bastian Boerge
Diaz-Cambronero Oscar
Gillies Robyn
Hellblom Anna
Heytens Luc
Hopkins Philip M.
Johannsen Stephan
Jungbluth Heinz
Klein Andrea
Litman Ron
Lokken Nicoline
Muenster Tino
Reimann Jens
Riazi Sheila
Rueffert Henrik
Silva Helga C.A.
Snoeck Marc M.J.
Tasca Giorgio
van den Bersselaar Luuk R.
Veyckemans Francis
Vilchez Juan Jesus
Voermans Nicol C.
Publication venue
Publication date: 01/01/2022
Field of study

BACKGROUND Patients with neuromuscular conditions are at increased risk of suffering peri-operative complications related to anaesthesia. There is currently little specific anaesthetic guidance concerning these patients. Here we present the European Neuromuscular Centre (ENMC) consensus statement on anaesthesia in patients with neuromuscular disorders as formulated during the 259th ENMC workshop on Anaesthesia in neuromuscular disorders. METHODS International experts in the field of (paediatric) anaesthesia, neurology and genetics were invited to participate in the ENMC workshop. A literature search was conducted in PubMed and EMBASE whose main findings were disseminated to the participants and presented during the workshop. Depending on specific expertise, participants presented the existing evidence and their expert opinion concerning anaesthetic management in six specific groups of myopathies and neuromuscular junction disorders. The consensus statement was prepared according to the Appraisal of Guidelines for REsearch & Evaluation (AGREE II) reporting checklist. The level of evidence has been adapted according to the Scottish Intercollegiate Guidelines Network (SIGN) grading system. The final consensus statement was subjected to a modified Delphi process. RESULTS A set of general recommendations valid for the anaesthetic management of patients with neuromuscular disorders in general have been formulated. Specific recommendations were formulated for 1) neuromuscular junction disorders; 2) muscle channelopathies (non-dystrophic myotonia and periodic paralysis); 3) myotonic dystrophy (type 1 and 2); 4) muscular dystrophies; 5) congenital myopathies and congenital dystrophies and 6) mitochondrial and metabolic myopathies. CONCLUSION This ENMC consensus statement summarizes the most important considerations for planning and performing anaesthesia in patients with neuromuscular disorders

Repositório Institucional UNIFESP

PubMed Central

Institutional Repository Universiteit Antwerpen

King's Research Portal

Bern Open Repository and Information System (BORIS)